Statistics with R Capstone: Univariate Analysis

n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
1000 0 686 1 1477 553.4 831.8 898.9 1092.0 1411.0 1743.2 2127.6 2450.7
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
1000 0 535 1 181190 84086 85495 104860 129763 159467 213000 285000 345700
20 30 40 45 50 60 70 75 80 85 90 120 160 180 190 Total
Frequency 379 49 1 7 93 195 34 6 39 21 35 69 46 7 19 1000
Proportion 0.379 0.049 0.001 0.007 0.093 0.195 0.034 0.006 0.039 0.021 0.035 0.069 0.046 0.007 0.019 1.000
C (all) FV I (all) RH RL RM Total
Frequency 9 56 1 7 772 155 1000
Proportion 0.009 0.056 0.001 0.007 0.772 0.155 1.000
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
833 167 104 0.998 69.21 24.94 32.0 42.2 57.0 69.0 80.0 95.0 108.0
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
1000 0 785 1 10352 5671 3228 4764 7314 9317 11650 14141 16694
Grvl Pave Total
Frequency 3 997 1000
Proportion 0.003 0.997 1.000
Grvl Pave Total
Frequency 33 34 67
Proportion 0.033 0.034 0.067
Reg IR1 IR2 IR3 Total
Frequency 629 338 30 3 1000
Proportion 0.629 0.338 0.030 0.003 1.000
Bnk HLS Low Lvl Total
Frequency 33 38 20 909 1000
Proportion 0.033 0.038 0.020 0.909 1.000
AllPub Total
Frequency 1000 1000
Proportion 1 1
Corner CulDSac FR2 FR3 Inside Total
Frequency 173 76 36 5 710 1000
Proportion 0.173 0.076 0.036 0.005 0.710 1.000
Gtl Mod Sev Total
Frequency 962 33 5 1000
Proportion 0.962 0.033 0.005 1.000
Blmngtn Blueste BrDale BrkSide ClearCr CollgCr Crawfor Edwards Gilbert Greens GrnHill IDOTRR MeadowV Mitchel NAmes NoRidge NPkVill NridgHt NWAmes OldTown Sawyer SawyerW Somerst StoneBr SWISU Timber Veenker Total
Frequency 11 3 10 41 13 85 29 60 49 4 2 35 16 44 155 28 4 57 41 71 61 46 74 20 12 19 10 1000
Proportion 0.011 0.003 0.010 0.041 0.013 0.085 0.029 0.060 0.049 0.004 0.002 0.035 0.016 0.044 0.155 0.028 0.004 0.057 0.041 0.071 0.061 0.046 0.074 0.020 0.012 0.019 0.010 1.000
Artery Feedr Norm PosA PosN RRAe RRAn RRNe RRNn Total
Frequency 23 53 875 8 11 11 14 2 3 1000
Proportion 0.023 0.053 0.875 0.008 0.011 0.011 0.014 0.002 0.003 1.000
Artery Feedr Norm PosA PosN RRNn Total
Frequency 2 6 988 1 2 1 1000
Proportion 0.002 0.006 0.988 0.001 0.002 0.001 1.000
1Fam 2fmCon Duplex Twnhs TwnhsE Total
Frequency 823 20 35 38 84 1000
Proportion 0.823 0.020 0.035 0.038 0.084 1.000
1.5Fin 1.5Unf 1Story 2.5Unf 2Story SFoyer SLvl Total
Frequency 98 8 521 10 286 36 41 1000
Proportion 0.098 0.008 0.521 0.010 0.286 0.036 0.041 1.000
Very Excellent Excellent Very Good Good Above Average Average Below Average Fair Poor Very Poor Total
Frequency 9 40 122 200 238 305 68 9 8 1 1000
Proportion 0.009 0.040 0.122 0.200 0.238 0.305 0.068 0.009 0.008 0.001 1.000
Excellent Very Good Good Above Average Average Below Average Fair Poor Very Poor Total
Frequency 12 47 131 193 561 36 14 3 3 1000
Proportion 0.012 0.047 0.131 0.193 0.561 0.036 0.014 0.003 0.003 1.000
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
1000 0 102 1 1972 33.14 1919 1925 1955 1975 2001 2006 2007
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
1000 0 61 0.998 1984 23.1 1950 1950 1966 1992 2004 2007 2007
Flat Gable Gambrel Hip Mansard Total
Frequency 9 775 8 204 4 1000
Proportion 0.009 0.775 0.008 0.204 0.004 1.000
CompShg Metal Tar&Grv WdShake WdShngl Total
Frequency 984 1 11 2 2 1000
Proportion 0.984 0.001 0.011 0.002 0.002 1.000
AsbShng BrkComm BrkFace CemntBd HdBoard ImStucc MetalSd Plywood Stucco VinylSd Wd Sdng WdShing Total
Frequency 12 1 36 40 164 1 147 74 15 349 138 23 1000
Proportion 0.012 0.001 0.036 0.040 0.164 0.001 0.147 0.074 0.015 0.349 0.138 0.023 1.000
AsbShng Brk Cmn BrkFace CBlock CmentBd HdBoard ImStucc MetalSd Plywood Stucco VinylSd Wd Sdng Wd Shng Total
Frequency 10 3 23 1 40 150 8 148 96 14 345 130 32 1000
Proportion 0.010 0.003 0.023 0.001 0.040 0.150 0.008 0.148 0.096 0.014 0.345 0.130 0.032 1.000
BrkCmn BrkFace None Stone Total
Frequency 7 8 317 593 75 1000
Proportion 0.007 0.008 0.317 0.593 0.075 1.000
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
993 7 258 0.787 104.1 159.2 0.0 0.0 0.0 0.0 160.0 336.0 480.4
Excellent Good Average/Typical Fair Total
Frequency 39 337 613 11 1000
Proportion 0.039 0.337 0.613 0.011 1.000
Excellent Good Average/Typical Fair Total
Frequency 4 116 861 19 1000
Proportion 0.004 0.116 0.861 0.019 1.000
BrkTil CBlock PConc Slab Stone Total
Frequency 102 430 453 12 3 1000
Proportion 0.102 0.430 0.453 0.012 0.003 1.000
Excellent Good Typical Fair Poor Total
Frequency 87 424 438 28 1 978
Proportion 0.087 0.424 0.438 0.028 0.001 0.978
Excellent Good Typical Fair Poor Total
Frequency 2 44 908 23 1 978
Proportion 0.002 0.044 0.908 0.023 0.001 0.978
Good Exposure Average Exposure Minimum Exposure No Exposure Total
Frequency 98 157 87 635 977
Proportion 0.098 0.157 0.087 0.635 0.977
Good Living Quarters Average Living Quarters Average Rec Room Below Average Living Quarters Low Quality Unfinished Total
Frequency 294 163 87 107 48 279 978
Proportion 0.294 0.163 0.087 0.107 0.048 0.279 0.978
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
999 1 528 0.973 464.1 497.8 0 0 0 400 773 1101 1321
Good Living Quarters Average Living Quarters Average Rec Room Below Average Living Quarters Low Quality Unfinished Total
Frequency 11 20 24 29 31 863 978
Proportion 0.011 0.020 0.024 0.029 0.031 0.863 0.978
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
999 1 105 0.307 48.07 89.49 0.0 0.0 0.0 0.0 0.0 108.0 405.6
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
999 1 605 0.999 547 469.6 0.0 54.8 223.5 461.0 783.0 1183.0 1422.2
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
999 1 593 1 1059 464.2 483.0 609.6 797.5 998.0 1301.0 1614.6 1838.0
GasA GasW Grav OthW Wall Total
Frequency 988 8 2 1 1 1000
Proportion 0.988 0.008 0.002 0.001 0.001 1.000
Excellent Good Average/Typical Fair Poor Total
Frequency 516 157 304 22 1 1000
Proportion 0.516 0.157 0.304 0.022 0.001 1.000
N Y Total
Frequency 55 945 1000
Proportion 0.055 0.945 1.000
Standard Average Fair Poor Total
Frequency 932 54 12 2 1000
Proportion 0.932 0.054 0.012 0.002 1.000
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
1000 0 624 1 1157 422.7 639.5 746.7 876.2 1080.5 1376.2 1696.0 1866.1
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
1000 0 287 0.782 315.2 427.6 0.0 0.0 0.0 0.0 688.2 917.0 1116.2
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
1000 0 12 0.036 4.32 8.584 0 0 0 0 0 0 0
n missing distinct Info Mean Gmd
999 1 4 0.744 0.4474 0.5227
n missing distinct Info Mean Gmd
999 1 3 0.167 0.06106 0.1153
n missing distinct Info Mean Gmd
1000 0 5 0.764 1.541 0.5427
n missing distinct Info Mean Gmd
1000 0 3 0.703 0.378 0.4811
n missing distinct Info Mean Gmd
1000 0 7 0.826 2.806 0.8418
n missing distinct Info Mean Gmd
1000 0 3 0.118 1.039 0.07888
Excellent Good Average/Typical Fair Poor Total
Frequency 67 403 509 20 1 1000
Proportion 0.067 0.403 0.509 0.020 0.001 1.000
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
1000 0 12 0.953 6.34 1.698 4 5 5 6 7 8 9
Typical Minor Deductions 1 Moderage Deductions Major Deductions 1 Salvage Only Total
Frequency 935 42 16 6 1 1000
Proportion 0.935 0.042 0.016 0.006 0.001 1.000
n missing distinct Info Mean Gmd
1000 0 5 0.804 0.597 0.6627
Excellent Good Typical Fair Poor Total
Frequency 16 232 219 24 18 509
Proportion 0.016 0.232 0.219 0.024 0.018 0.509
2Types Attchd Basment BuiltIn CarPort Detchd Total
Frequency 10 610 11 56 1 266 954
Proportion 0.010 0.610 0.011 0.056 0.001 0.266 0.954
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
952 48 93 0.999 1978 28.2 1926 1941 1961 1979 2002 2006 2007
Finished Rough Finished Unfinished Total
Frequency 247 278 427 952
Proportion 0.247 0.278 0.427 0.952
n missing distinct Info Mean Gmd
999 1 6 0.828 1.767 0.7892
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
999 1 370 1 475.4 239.9 178.4 240.0 312.0 480.0 576.0 766.0 864.1
Excellent Good Typical Fair Poor Total
Frequency 1 7 904 37 3 952
Proportion 0.001 0.007 0.904 0.037 0.003 0.952
Excellent Good Typical Fair Poor Total
Frequency 1 6 918 21 6 952
Proportion 0.001 0.006 0.918 0.021 0.006 0.952
Paved Partial Pavement Dirt/Gravel Total
Frequency 904 29 67 1000
Proportion 0.904 0.029 0.067 1.000
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
1000 0 215 0.862 93.84 123.1 0.0 0.0 0.0 0.0 168.0 250.2 312.0
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
1000 0 176 0.916 48.93 64.9 0 0 0 28 74 132 191
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
1000 0 101 0.42 23.48 41.46 0.0 0.0 0.0 0.0 0.0 115.0 169.1
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
1000 0 15 0.041 3.118 6.177 0 0 0 0 0 0 0
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
1000 0 57 0.219 14.77 27.7 0.0 0.0 0.0 0.0 0.0 0.0 153.1
n missing distinct Info Mean Gmd
1000 0 4 0.009 1.463 2.923
Excellent Good Fair Total
Frequency 1 1 1 3
Proportion 0.001 0.001 0.001 0.003
Good Privacy Minimum Privacy Good Wood Minimum Wood/Wire Total
Frequency 43 120 37 2 202
Proportion 0.043 0.120 0.037 0.002 0.202
Gar2 Othr Shed TenC Total
Frequency 2 1 25 1 29
Proportion 0.002 0.001 0.025 0.001 0.029
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
1000 0 18 0.085 45.81 90.64 0 0 0 0 0 0 0
n missing distinct Info Mean Gmd .05 .10 .25 .50 .75 .90 .95
1000 0 12 0.987 6.243 3.108 2 3 4 6 8 10 11
n missing distinct Info Mean Gmd
1000 0 5 0.954 2008 1.469
COD Con ConLD ConLI ConLw CWD New Oth VWD WD Total
Frequency 27 5 7 5 6 3 79 4 1 863 1000
Proportion 0.027 0.005 0.007 0.005 0.006 0.003 0.079 0.004 0.001 0.863 1.000
Abnorml AdjLand Alloca Family Normal Partial Total
Frequency 61 2 4 17 834 82 1000
Proportion 0.061 0.002 0.004 0.017 0.834 0.082 1.000

Preprocessing

Create log transformations the following variables 1. Area 2. Price 3. Lot Frontage 4. Lot Area

  1. Area: Create log transformation
  2. Price: Crate

2018-08-04